A Semantic Kernel to Exploit Linguistic Knowledge
نویسندگان
چکیده
Improving accuracy in Information Retrieval tasks via semantic information is a complex problem characterized by three main aspects: the document representation model, the similarity estimation metric and the inductive algorithm. In this paper an original kernel function sensitive to external semantic knowledge is defined as a document similarity model. This semantic kernel was tested over a text categorization task, under critical learning conditions (i.e. poor training data). The results of cross-validation experiments suggest that the proposed kernel function can be used as a general model of document similarity for IR
منابع مشابه
Chinese Hedge Scope Detection Based on Structure and Semantic Information
Hedge detection aims to distinguish factual and uncertain information, which is important in information extraction. The task of hedge detection contains two subtasks: identifying hedge cues and detecting their linguistic scopes. Hedge scope detection is dependent on syntactic and semantic information. Previous researches usually use lexical and syntactic information and ignore deep semantic in...
متن کاملM ODELS by Tong Wang A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy
Exploiting Linguistic Knowledge in Lexical and Compositional Semantic Models Tong Wang Doctor of Philosophy Graduate Department of Computer Science University of Toronto 2016 A fundamental principle in distributional semantic models is to use similarity in linguistic environment as a proxy for similarity in meaning. Known as the distributional hypothesis, the principle has been successfully app...
متن کاملInduction of Classifiers through Non-Parametric Methods for Approximate Classification and Retrieval with Ontologies
This work concerns non-parametric approaches for statistical learning applied to the standard knowledge representations languages adopted in the Semantic Web context. We present methods based on epistemic inference that are able to elicit and exploit the semantic similarity of individuals in OWL knowledge bases. Specifically, a totally semantic and language independent semi-distance function is...
متن کاملThe Manifestation Challenge: The Debate between McDowell and Wright
In this paper, we will discuss what is called “Manifestation Challenge” to semantic realism, which was originally developed by Michael Dummett and has been further refined by Crispin Wright. According to this challenge, semantic realism has to meet the requirement that knowledge of meaning must be publically manifested in linguistic behaviour. In this regard, we will introduce and evaluate John...
متن کاملLinguagrid: a network of Linguistic and Semantic Services for the Italian Language
In order to handle the increasing amount of textual information today available on the web and exploit the knowledge latent in this mass of unstructured data, a wide variety of linguistic knowledge and resources (Language Identification, Morphological Analysis, Entity Extraction, etc.). is crucial. In the last decade LRaas (Language Resource as a Service) emerged as a novel paradigm for publish...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005